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Abstract 

For a large class of expanding maps of the interval, we prove that par- 
tial sums of Lipschitz observables satisfy an almost sure central limit theo- 
rem (ASCLT). In fact, we provide a rate of convergence in the Kantorovich 
distance. Maxima of partial sums are also shown to obey an ASCLT. The 
key-tool is an exponential inequality recently obtained. Then we estab- 
lish (optimal) almost-sure convergence rates for the supremum of moving 
averages of Lipschitz observables (Erdos-Renyi type law). This is done 
by refining the usual large deviations estimates available for expanding 
maps of the interval. We end up with an application to entropy estima- 
tion ASCLT's that refine Shannon- McMillan-Breiman and Ornstein- Weiss 
theorems. 

1 Introduction and results 

Almost sure central limit theorems were first derived for independent sequences 
of random variables by Brosamler [6 , Schatte [25] and Fisher [TH] (see also [55]). 
In its simplest form the result states that if X\, X2, ... is an i.i.d. sequence of 
real random variables with zero mean and unit variance (and satisfying some 
adequate moment condition), if (S n ) is the sequence of partial sums 
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then almost surely, for any real x 
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where D n = Ylk=i k -1 = log"- + Notice that this refines the following 

result by Erdos-Hunt [TB] which can be seen as a discrete version of the strong 
arc sine law: 
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provided that the distribution of X\ is symmetric. The logarithmic average in 
([l]) may look surprising at first glance. However, the Erdos-Hunt result already 
implies that the Cesaro mean does not converge with probability one. We refer 
to [3J for a more general discussion showing that the logarithmic average is 
essentially the only one that works. 

After the initial discovery, a large literature was published extending the 
result in various directions. We refer to [3], [1] and pQ for references and to [32] 
for a generic result working for aperiodic dynamical systems. 

Results for sequences of dependent random variables have also been obtained 
under some mixing conditions (see [3J and references therein). However dynam- 
ical systems do not often satisfy these mixing conditions with respect to the 
most natural observables in their phase space. We will consider below the fam- 
ily of dynamical systems given by piecewise expanding maps of the interval. We 
assume that the map / is a piecewise monotonic transformation with b branches 
and we denote by (aj)' =0 the corresponding subdivision of [0, 1]. We also as- 
sume that / is monotonic and extends to a C 2 map on each Ai = [o,-, Oi+i]. 
Finally we assume that the map is topologically mixing and there is a constant 
T] > 1 and an integer m such that for any x S [0,1), \(f m )'(x)\ > rj. Under 
these conditions this dynamical system has a unique absolutely continuous in- 
variant measure dfi = h dx [30] satisfying exponential decay of correlations for 
functions of bounded variation [23] . For such observables one also has a central 
limit theorem [23J . It is therefore natural to investigate the truth of almost sure 
central limit theorems in this class of systems which are rather well understood. 
We expect that the techniques developed below will prove useful in more gen- 
eral situations. P and E will refer to the probability and the expectation with 
respect to fi. 

We will prove below a slightly stronger form of the almost sure central limit 
theorem ([1} which is formulated using convergence in the Kantorovich distance 
k (we refer to |42j for equivalent definitions and properties of this distance) . We 
will denote by C the set of Lipschitz functions with Lipschitz constant equal to 
one and vanishing at the origin. We recall that if v\ and v 2 are two probability 
measures, k is given by 

k(vi,V2) = sup / g(x) d{yx - u 2 )(x) . (2) 

g£C J 

Note that since v\ and v 2 are probability measures, in the above integral we 
can replace g by g — g(0), or in other words there is no restriction in assuming 
g(0) = 0. It is convenient to define the sequence of weighted empirical (random) 
measures of the average by 

1 ™ 1 
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where S a is the Dirac measure at point a of the real line. We investigate the con- 
vergence of this random measures to a Gaussian distribution in the Kantorovich 
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metric. 

We now state our first main result. 



Theorem 1.1 Consider a piecewise C 2 expanding map f of the interval which 
is topologically mixing and let d/i = h dx be its unique absolutely continuous 
invariant probability. Consider the sequence of random variables Xj = u o f ' J 
where u is a Lipschitz function with zero fj, average, and assume that the quantity 
a 2 given by 
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a 2 = / u 2 dn + 2 V / 
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u o f J u d/i (4) 
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is non zero. Then Lebesgue almost surely 

lim K (£„,Af(0,o- 2 )) = , (5) 

n — >oo 

where 7V"(0, er 2 ) is the Gaussian measure with mean zero and variance a 2 . More 
precisely, there is a constant Cq > such that, for all n > 3, 




>1 < 



(logj) 1 / 3 ^,-^^ 2 )) 
Vloglogj 

g-Codogn) 1 / 3 _ 

Notice that the last estimate in the theorem provides an upper bound to the 
velocity of the approximation by the Gaussian measure. 

It follows from the proof that there exist constants C\ > and Ci > such 
that for any n > 3 and for any t > 



U£ n ,mO,a 2 ))>t + C 2 ^^ 



< 2 e 



-C!t 2 D„ 



This is a kind of large deviation bound. In the independent case, and for the 
almost sure weak convergence large deviation, estimates have been obtained in 

It follows from the decay of correlations that the quantity a 2 is finite and non 
negative (see [33]). We also recall that since 7V(0, a 2 ) has a bounded density, 
convergence in the Kantorovich metric implies convergence in the Kolmogorov 
metric and in other topologies, see e.g. [H] for a review. We recall that the 
convergence in the Kantorovich metric follows from the weak convergence and 
the convergence of the integral of the function g{x) = \x\. However we will 
handle directly the Kantorovich metric in order to get estimates on the speed 
of convergence. 

Define now the measure Q (a) by 

dG{a){x) := -?0-0(x) e- x2/2a dx, 

V7TC7 

where *d is the Heaviside function. In the following theorem, S* = sup fc<n Sk- 
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Theorem 1.2 Under the assumptions of theorem ] 1. 11 the sequence of random 
measures 

D n ^ k 



Mn = —Y^ T Js t , (6) 
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converges Lebesgue almost surely to Q{u) in the Kantorovich metric. 

The same result holds for the minima of partial sums inffc<„ Sk- Velocity of 
convergence estimates could also be obtained. 

In order to state the Erdos-Rcnyi Theorem, we recall the large deviation 
result for expanding maps of the interval [Sj . Let u be a function of bounded 
variation on [0, 1]. Without loss of generality, we assume that E(it) = 0. Suppose 
that a > where a is defined by formula (U|). 

There exists (3q > such that for any \/3\ < (3o the following limit exists 

F(fi) = lim -]ogZ n (fi) 

n — >oo 77, 

where 

Z n {p) = ( e p ^=« u(p(x)) dii{x) . 



For \(3 1 < Po the function F is analytic (see [5]), and we denote by ip its Legendre 
transform. Recall that the function ip is analytic for |a| < ao, where ao > is 
small enough. Moreover tp is non negative and strictly convex and its minimum, 
which is equal to zero, is attained at the /i-expectation of u, that is 0. In 
particular, for any fixed < a < ao, and any e > 0, there is an integer tiq — 
no(e) such that for any integer k > no we have 

e -k{ 9 {a)+e) < p / 1 o f j > a } < e -k(v{<*)-*) . (7) 

We will need a much sharper result given in Appendix A. Let 
A/ fe = sup S k o f . 

0<j< [ exp(ktp(a))] -k 

Theorem 1.3 Assume that u is function of bounded variation of [0, 1]. Then, 
under the hypotheses of Theorem ] 1. 11 there is a number a* > (a* < ao) such 
that for any |a| < a* , we have Lebesgue almost surely 

M k -ka 1 

limSUp : ; < — 

k^oo log k 2/3 

and 

.. . . Mk-ka 1 
hmmi — : ; — > — — — 

k~>oo log k 2p 

where (3 = ip'(a). 
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Note that this implies 



lim — — = a Lebesgue almost surely . 



Remarks. 

1. We refer to [H] for an optimal estimate in the case of independent random 
variables. We notice that we get the same rate of convergence. It follows from 
the estimate below that one can derive estimates on the rate of convergence, 
however these estimates depend on quantities like the derivative of the pressure 
function F which are not easily controlled in terms of data on the map /. 

2. As observed by Comets [13] in a different context (Ising model of Statistical 
Mechanics on a lattice), the Erdos-Renyi Theorem can be used as a statistical 
tool to determine the large deviation function. As a first application, we recall 
that for full Markov expanding maps of the interval, the essential spectral radius 
of the transfer operator is related to the function F which is the inverse Legendre 
transform of tp (see [TT] and for extensions [55] and [TO])- In other words, this 
Theorem provides a statistical tool to estimate a generic lower bound on the 
decay of correlations. Other applications of the Erdos-Renyi Theorem will be 
discussed in forthcoming publications. 

3. It follows from results in [7] and [33j that, since we assume that our map is 
topologically mixing, the density h of the absolutely continuous invariant mea- 
sure jjL is bounded below away from 0. Another sufficient condition is provided 



The rest of the paper is organized as follows. In section 2 we prove theorems 
11.11 and 11.21 We first derive an estimate on the expectation of the Kantorovich 
distance. We next apply an exponential inequality proven in [12] which allows 
to control the deviations of the Kantorovich distance from its expectation. The 
result follows from a Borel-Cantelli type argument. The Erdos-Renyi Theorem 
is proven in section 3 using a precise large deviations estimate of independent 
interest which is discussed in the appendix. In Section 4 we deduce ASCLT's 
refining both Shannon-McMillan-Breiman and Ornstein- Weiss theorems. The 
latter estimates the entropy of the system by the recurrence rate of typical 
trajectories. 

In the sequel, C, C±, etc, will denote various constants whose value may vary 
with the context. 

2 Almost sure convergence for Lipschitz observ- 
ables 

We will say that a real- valued function K on [0,1]™ is separately Lipschitz in 
all its variables, if for any 1 < i < n we have 



in0. 



sup 



K(. 



n 




< oo (8) 



Xi-y 



■5 



where in the above notation the indices out of range are absent. 
We recall some inequalities proven in [12j . 

Theorem 2.1 There is a constant C > such that for any integer n and for 
any real valued function K on [0, 1]™ separately Lipschitz in all its variables, we 
have 

' e K ( x,.,r-V))-EW d/i(a .) < e cnu h ^f , (9) 



where ~E(K) is the average 

E(K)= J K{x,---,f n - 1 {x))d^x) 



We have also 



>(\K(x,---J n - 1 (x))-E(K)\>^ <2e- t2 /( 4C ^ L ^ 2 ) 



(10) 



Recall (see for example [42]) that the Kantorovich distance is also given by 
At(£„,AA(0,a 2 )) = / \T n {x) -F a (x)\ dx 



where T n and F a are the laws of £„ and A/"(0, a 2 ) respectively, or in other words 

(i? is the Heaviside function.) 
We will first prove that 

lim E U(£ n , Af(0,o- 2 ))) = . 

n— >oo 

In fact, we will prove a stronger result estimating the speed of convergence to 
zero which is useful for the second part of Theorem 11.11 

Proposition 2.2 Under the hypotheses of Theorem \l.l\ there is a positive con- 
stant C such that for any n > 3 we have 

^ COoglogn) 1 /? 



E («(£„, wV(0, cr 2 ))) < 



(logn) 1 ^ 



Proof. First introduce the following notation for convenience: A n — CVlog log n, 
where C is a large positive constant to be fixed later. Introduce also the fol- 
lowing sequence of functions: ipn( x ) = ip(x/A n ) where ip is a non-negative C 2 
function bounded by one, equal to for \x\ > 2 and to 1 for \x\ < 1. 
It is easy to check that 

E(K(£ n ,M(0,a 2 ))) < (1 + 0(1)/A n ) E ( sup / g d(E n - Af(0, a 2 ))) + 

\geCJA n J 

(1 + 0(1)/A n ) E (sup / " g d(£ n - M(0, a 2 ))) + 

\geCj-oc J 
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E( sup / gi> n d{£ n -N{0,<r 2 )) 



Since g(0) = we have 
E (sup [ g d(£ n - W(0, a 2 ))) < E ( [ x d£ n (x) 



x dAf(0,a 2 )(x) 



The last integral is bounded above by e °^ A ^ for n large enough. On the 
other hand, an integration by parts leads to 



k—l 

Since the function 



El^j x d£ n (x)J < 



k=l 



K(x u 
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is separately Lipschitz with Lipschitz constants all equal to L(u), it follows from 
(fTU)) that there are two positive constants C\ and Ci such that for any integer 
n and any t > 



(11) 



(12) 



W\\S q \ > t + E(\S q \) \ < c ie - C2t2/q . 

It is proved in 23J that for any q > 1 

E(\S q \)<E(S 2 ) 1/2 <0(l)Vq. 
We now choose C large enough such that for any q > 1 and n > 3 

A»Vg-E(|S,|)>^. 
Using (fTTjl with t — x^fq — E(|5 g |) for x > A n , and inequality (fT2|) . we obtain 

A n P(5 g > A n ^q) + / P(S 9 > x^q) dx < 0{l)e~°^ A ™ 



and therefore 



(sup r g d(£ n ~JV(0,a 2 ))) < 0(l)e-°^ A ~ . 

\gGCJA n J 



Similarly 



E 



E sup 

\g£C J — oo 



d(£ n -Af{0,a 2 )) <0(l)e 



-0(l)Al 



We now handle the middle range integral 

E(sup / giP n d(£ n -N(0,a 2 )) 
\gecJ 
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We introduce a sequence of finite sets C n of Lipschitz functions defined for 
n > 3 and with Lipschitz constant at most 2. This is a set of functions g defined 
on [— 2A n — l,2A n + 1] which are piecewise affine on each interval between 
consecutive points of (Z(log n) -1 / 3 ) n [—2A n — 1, 2A n + 1], which are zero at the 
origin and satisfy for any integer k € (logn) 1 ' 3 [— 2A n — l,2A n + 1] 

ff ((fc + l)(log7 1 )- 1 / 3 )-« ? (A ; (logn)- 1 /3) e (log n )-i/3 (zn [-2,2]). 

Finally, one takes g constant on the intervals 

[ - 2A n - 1, -[(2A n + lJClognJ^Oogn)- 1 ^] 

and 

[[(2A n + lJOogn^lOogn)- 1 / 3 ]^ + l] • 
It is easy to verify that for any I € £, there is a g G C n such that 

sup \g(x) - t[x)\ < 3(logn)- 1/3 . 

xe[-2A n -\,2A n +l] 

By a simple computation one gets 

Card(£ n ) < 5 2 ( 4 ' 4 "+ 3 )( 1 °g") 1/3 . (13) 

This estimate is related to the e-entropy of £, see [27] . [34] . 
We therefore have 

E (sup / ip n g{d£„ - dAf (0, a 2 ))) < 

KgeC J -oo / 

fifsup / + ^ n g{d£ n -dN{0,a 2 ))]+O{l)A n {\ogn)- 1 l 3 . 

\geC„ J-oo / 

We now use Pisier's inequality (see [13]) to estimate the integral on the right 
hand side. We get 

E(e (log 11)2/3 ^ ^9d(£ n -Af(o^))^ ^ 

The factor (logn) 2 / 3 in the exponent will be convenient to balance later on the 
different bounds and, in particular, the 'entropy' contribution (|13p . We have 

/+oo 
VV? d£ n = K n (x,f(x),---,f n -\x)) 
-oo 

where 

K n (xi, ■ ■ ■ ,X n ) = V -(V'nff) ( ^ P ) 

and this function is separately Lipschitz with 
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uniformly in n and in g G C n . We now apply Jensen's inequality to the left 
hand side of (pB|) and the exponential inequality @ to each term of the right 
hand side together with (TT5|) and we obtain 



e (logn) 2 / 3 E(sup ge£ri i> n gd(E n -M(0,a 2 ))) 

< 5 2(4A„+3)(logri) 1 / 3 e O(l)(logri) 4 / 3 Z)-y^^ _ 

(15) 

We now take the logarithm on both sides and divide by (logn) 2 / 3 . We obtain 
E ( sup / ijj n gd{E n - Af(0, a 2 ))) < 

\g£C n J -oo / 

° ii]A " 1 sup e( ^ ip n9 d(£ n -N{0,a 2 )) 



(logn)V3 



gec n 



We now estimate the last term in the right-hand side of this inequality. 

Let Vk be the law of S k / V~k. Since g G C n is differentiable except in 
Z(logn) -1 / 3 we have 

SO /"OO 

i> n gdv k < I dx(ip n g)'(x)v k ((-ao,x]) + 

DO J — OO 

J2 AMq^ogny^i^gXqilogn)- 1 / 3 ) , 

| ? |<(2A„ + l)(log«)i/3 

where 

Av k (s) = i/ fc ((-oo, s]) - i/ k ((-oo, s~]) . 

To estimate Ais k , we will use the Berry-Esseen inequality proved in our context 
by A. Broise [5]). She indeed proved that 



sup 



K[o{x-^=) J -/vi,) 



< 0{l)k 



-1/2 



uniformly in k > 1. Since the Gaussian has a bounded density, we have uni- 
formly in k 

sup A^(g(logn)~ 1/3 ) < Oil)^ 1 / 2 

| 9 |<(2A„+l)(logn)V3 

therefore uniformly in n and fe 

/oo 
Vvg d7V(0, a 2 ) + Oi^A^lognY^k- 1 ' 2 . 
-oo 

From ([3]) we have 

E ( / lp n gd£ n ) = ~PT y ' f / V'nff <^fc 



/OO \ 71 i PCX) 

*l> n gd£ n = — J] - / V«£ 
-oo / fc=1 ft J-oo 



Then it follows easily that uniformly in n > 1 and g G £„ we have 

/ /*°° \ /*°° 

El / i; n gd£ n )= i> n g(W{0, a 2 ) + 0(l)A n (\og n)" 2 / 3 . 

\«/ — CO / J — oo 
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Using this estimate in (fT5|) , the proposition follows. 



We now handle the proof of Theorem ll.il In order to apply the exponential 
inequality, we consider the sequence of functions 



K n (x 



1 , . . . , Jb n I 



dx 



n 

— y U 

D n ^ k 

" fc=l 



Eh 
1=1 x l 



After an easy computation one gets for any 1 < p < n 

0(1) 



L p (K n ) 



< 



We can now apply the exponential inequality Q to get the existence of a con- 
stant C\ > such that for any n and for any t > 



K(S n , Af(0, a 2 )) - E (/s(£ n , AA(0, a 2 ))) 



> * < 2e 



-Cit 2 D„ 



This implies the first part of Theorem 11.1 
We define the sequence (n^) by 



n k — e 



We conclude from the above estimate and Proposition 12.21 that for a positive 
constant 9 large enough 



eqoglognfc) 1 / 2 
(logn fc ) 1 /3 



< oo 



which implies by the Borel-Cantelli lemma that the sequence n(£n k , A/"(0, tr 2 )) 
converges to zero almost surely. More precisely 



fc^logn) 1 / 2 



]T PU(£„ fc ,AA(0,a 2 )) > 



eqoglognfc) 1 / 2 
(logn fc ) 1 /3 



< 



0(1) Yl e ~° (1)l <0(l)e-° (1)(logn) 

<'>(logri) 1 /3 

From ([21) it follows that for n > nu 



1/3 



A. 



1 n 1 



D n A — ' J 

71 j=n fc +l J 



/V7- 



This implies 



K (£„,AA(0,a 2 )) - K (£ nk ,JV(0,a 2 )) \ < 



-^.(£„ fc ,AA(0, ( r 2 ))+sup^- £ ~ 



g[%) I gdM(0,a 2 
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From now on, we assume that n < rife+i, then the first term tends to zero almost 
surely by our previous estimates and is more precisely ©((logn^) -1 / 3 ). We now 
prove that the second term tends to zero almost surely. We have 



1 

sup — 



E 

j=n k +l 



Si 

VJ 



S 4 - gdM(0,a 2 ) 



<- y - 

, n k+1 

— T - 



VJ 



\x\ dAf(0,a 2 )(x) 



< 



VJ 



+ / |x| dAf(0,a 2 )(x) 



We have for any k 

"fc+i 

D~ 



j=n k +l 

Using (fT2ll . we get 



— > - / £dA/(G,<r )(x) < 0(1) : 



1 5,- 



1 ^ 1 



<0(1) 



logn fc+ i - logn fc 
log rife 



We now observe that the function 



% ^hi) 71 - X 7" 



is separately Lipschitz with the following estimates, for k > 4: 

f o(i) for 1 < q < n fe + 1 



L q {K) < 



jqD n forrife + 1 < g < nfe+i. 



Therefore 



9=1 



Dn k+ i Dn k 

D 2 

n k 



Using (|10[) and the choice of rife , one easily gets for k > 1 



(sup ( (fogn,) 1 ^! £ 1 JM ) > (log 



logrz£) 1/2 < 



-C (Iogn f ) 1/3 



f>fc 



for some positive constant Co- 

Collecting all the above estimates finishes the proof of Theorem 11.11 
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The proof of Theorem 11.21 is similar to the proof of Theorem 11.11 Using the 
exponential inequality dTUJ) , it follows as before that 

P(\K(M n ,g(s))-E(K(M n ,g(s)))\>t) <2e-°^' D " . 



We have also 

S„ > 



+ E(|5*|)^ < 0{\)e- ^ u2 ' n . 



Let us now estimate the expectation. First note that E (\S* |) < E (sup 1<:)< „ \Sj |) . 
We use Pisier's inequality [33] to get 



l-SjlVlogi/V" 



E ^e suPl <j'<" ISjIViogn/V^^ < E ^< 

Taking logarithm of both sides, using Jensen's inequality in the left hand side 
and inequality @ in the right hand side we get 



E(\S* n \) <0(\)^n\ogn. 
Therefore, for any r\ > we have 



oo , 

71 = 1 ^ 



(logn) 1 / 2 ^ ) < oo 



We now deal with the expectation. 
Proposition 2.3 Under the assumptions of Theorem ] 1. 11 we have 

lim E( K (M n ,g(a))) = . 

n — >oo 

Proof. As before we have for any A > large enough and uniformly in n 



E I / dx 

>\x\>A 



< er O(l)A 2 



The middle integral is treated as before except for the term E (/ t/j^g dM n ) 
appearing in the analog of (fT5|) . 

It is enough to show that for any g € C n 

lim E((ip A g)(SZ/Vk)) = [ ^ A gdQ(o) . 



At this point we recall the almost sure invariance principle (ASIP) [T5], [23] . 
There is a positive number d and an enriched probability space f2 carrying also 
a Brownian motion (B t ) such that 

Sk — o~Bk = o(fc~ 5+1 / 2 ) eventually almost surely 

where the sequence (Sk) has the same joint distribution as (Sk)- In other words 
there is an almost-surely finite integer- valued random variable N(uj), where 
ui € 0, such that, for any j > N(u) 

|S>)-a5>)|<j-* +1/2 . 
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To derive the ASIP for (S'l) = (sup 3<fc Sk), notice that there are two cases for 
each k, namely the supremum is attained for an index jk < N(u>) or for an index 
jk > N(oj). We claim that for almost all u) the first case can occur at most for 
finitely many k's. Indeed, if it was not the case, since (£?£) is non decreasing, it 
would imply that this sequence is bounded. However, by the law of the iterated 
logarithm (for Bk) the sequence (Sk) diverges almost surely. In other words, the 
index for which the maximum is attained in S% is eventually almost surely larger 
than N(cu). The same argument holds for the sequence (Bj). We conclude that 
the ASIP also holds for the sequence (SI)- In particular 



Vk Vk 



almost surely 



Since sup,,, gR \(gip A )(x)\ < 2 A, we can use the dominated convergence theorem 
to conclude that for any A > 

lim (E((^ A g)(S-i/Vk))-E{(yj A g)(aBl/Vk))) = . 

Define B^* = sup 1<J<fc Bji^. Now observe that by rescaling we get 

E{(ij A g)(aBl/Vk))=E{(^ A g)(aBl*)) . 

Since trajectories of the Brownian motion are almost surely Holder continuous 
PU] . we obtain 

B%* — sup B t — > almost surely . 

0<t<l 

Using again the dominated convergence theorem we get 

lim E((^ A g)(aB* k *)) =E((^ A g)(a sup B t )) 

k->oo 0<t<l 

The proof of Theorem 1 1.21 is complete by letting A tend to infinity and by using 
the explicit expression for the law of sup 0<t<1 B t , see |20j . 

3 Proof of Theorem D 

From Lemma I A. 11 stationarity and the differentiability of y?, we have 

[cx.p(kip(a.)] — k 

F(M k > ka + x log k) < F ( S k ° P > ka + x log k) 

j=o 

< 0(l)e ktp ^ k^ 1 ^ 2 e^ kv ^ a+ ^ xl ° sk ^ k ^ < 0(l)k~ l t 2 e~ xip ' (°^ Xo&k 

and this is summable over k if x > l/(2ip'(a)) = 1/(2/?). We are done with the 
upper bound. 

For an integer r to be chosen later on independently of k and < e < 1 also 
independent of fc, we define a new quantity 

Mk = SUp SUp S k O prk+l[k^] 

0<]< [(exp(ksp(a))-k)/(rk)] EKZ^Ifc 1 - 6 / 4 ] 
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The gaps of size rk in the indices will allow us later on to use the decay of 
correlations. The small gap [fc e / 4 ] is chosen for convenience. It can be reduced 
to a large enough constant times logfc. Its role is to ensure that the probability 
of having simultaneously Sk(x) > ka and Sk o ^'^{x) > ka is small enough. 
We have of course 

M k < M k 

and therefore for any A 

P(M k < A) < P(M k < A) . 
We will now estimate this last quantity with A = ka — (1 + e) logfc/ (2/3). Let 
A = {x : S k {x) < ka - (1 + e)(logfc)/(2/3)} . 



We have 



E 



(M fe <fca-(l + e)(logfc)/(2/3)) = 
/ \ 

jrk+l[k e/4 ] 



II XW 

0<j< [(exp(fa^(a))-fc)/(j-fc)] 
o<i<[fc 1_€ / 4 ] 



/ 



In order to estimate this quantity, we will construct an upper bound as follows. 
Let tp be the non negative Lipschitz function defined by 

{1 for x < 

1 - x for < x < 1 
for 1 < x. 

We have obviously 

XA( x )<^(Sk(x)-ka+(l + e)(logk)/(2(3))<X B (x) 

where 



(16) 



Therefore 



B = {x : S k (x) <fca-(l + e)(logfc)/(2/3) + l} 



M k < ka- (l + e)(logfc)/(2/3)) 
/ \ 

jrfe+;[fe e/4 



< E 



n 

0<j<[(cxp(fc V ( Q ))-fe)/(rfc)] 



/ 



It is easy to verify that if v is a function of bounded variation and w is 
a Lipschitz function with Lipschitz constant K w we have with V denoting the 
total variation 

y{w ou)< K w V v . 

We recall (see [33]) that under our hypothesis on the transformation /, there is 
a positive constant A > 2 such that for any integer q we have 



yf q < A q 
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Therefore, if we define the function g\ by 



9l (x)= J] ^(sJf^(x)) -ka+(l + e)(logk)/m) , 
one gets easily 

Vgi < k 2 A 2k . 

We recall (see [23 ) that there exist two positive constants C and p < 1 such 
that if gi is a function of bounded variation and g 2 is integrable with respect 
to the Lebesgue measure (and hence also with respect to fj,), we have for any 
integer p 

J 9i 92 ° f p dp - J g\dp J g 2 dfi < Cy ( V g x + J \gi \ dx^j J \g 2 \ dx . 

We now apply this inequality with p = (r — l)k and 

(x)=g 2 , q (x)= J] tp{s k (f rk + l ^ /4 \x))-ka+(l + e)(logk)/(2/3)) 



<)-> 



o<i<?-i 

0<l<[k 1 - e / 4 ] 



q being an integer. We obtain the recursive bound for any q>l 

Jdp(x) H ^(^(/^ +i [ fe£/4 l(x))-fca + (l + e)(logfc)/(2/3) 



0<j<q 
0<l<[k 1 - e / 4 ] 



< (J ' 9l dp + Ck 2 A 2k p^ k 
J dp{x) \{ tf, (s k (P rk+l ^(x)) -ka+(l + e)(log £0/(2/3) 



o<i<?-i 

0<K[fc 1 - e / 4 l 



We now choose r (depending on a) such that 21og^4 + (r — l)logp < —tp(a). 
This implies for k large enough 

P(M k < ka- (l + e)(logft)/(2/3)) 

/ r \ [(o X p(fe V (a))-fc)/(rfc)]-l 

< I / gi dp + 0{l)e- klp{a) - kS \ (17) 

for some 5 > independent of k. 

We now have to estimate the integral of g\ from above. We have of course 
from (US) 

f i[k e/i ] 



/ 9i{x)dp< dp Y[ XB°f 

= [dp n (i - x B c ° / 
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Using Bonferoni's inequality, we get 

gi(x)dn<i- ]T [ d »XB° f l[k£/i] 

+ E /«w n[ * e/4] xw ro[fe£/4] • m 

0<m=£n<[k 1 -< L / i ] 

We observe that for fixed < e < 1 and for k large enough, we have from 
Lemma (jA.lj) 

fi(B c ) > 0(1) e -*v(<*) . 

Therefore using also the invariance of the measure /Lt, the opposite of the second 
term in the right hand side is bounded below by 

([fc 1 - 6 / 4 ] + l)n{B c ) > 0(1) fci+«/4 e -M°0 . (19) 

We now have to estimate the last term on the right hand side of (p~8|) and show 
in particular that it is much smaller than the modulus of the second term (for 
large k). This estimate is provided in the appendix by Lemma lA. 21 Collecting 
all the bounds, namely (fT7|) . (JTHJ) and (I19p . we finally get 

V(M k < ka-(l + e)Qogk)/(20)) < 0{l)e- c ^ e '^ 

where C > is some positive constant. The right hand side of this estimate 
is summable in k for any e > and the result follows using the Borel-Cantelli 
Lemma. 



4 Applications to entropy estimation 

We define n-cylinder sets as usual: A\" :— A il n f~ 1 A i2 n ■ ■ • H f~ n+1 A in , where 
the Ai's are the monotonicity /regularity intervals of the map /. Denote by 
V n the set of n-cylinders. For all x £ [0, 1] which is not the n-th preimage of a 
discontinuity point, there is a unique n-cylinder containing x, denoted by V n {x). 
Since / is expanding, the partition of [0, 1] into the sets ]cij, o»+i] generates the 
Borel (T-algebra. 

We assume that log|/'| is a Lipschitz function. Throughout this section, 
the observable u will be log|/'|. Recall that by Rokhlin formula [31], h fJi (f) = 

P»<a r '//'• 

The following theorem is a refinement of Shannon-McMillan-Breiman The- 
orem (see also [24], [46] for the usual central limit theorem). Let us define the 
sequence of weighted empirical (random) measures of minus the logarithm of 
the /z-measure of cylinders by 

1 ™ 1 

SMB n = — ^2 - 5(_ logM cp fc (.))_ M) /Vfe 
™ fe=i 

where h = h^{f). 
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Theorem 4.1 Under the hypotheses of Theorem \1.1\ and assuming that the 
function log \f'\ is Lipschitz, 

lim k (SMB n ,M(0, a 2 )) = , (20) 

n — >oo 

Lebesgue almost surely. 

Proof. By Theorem 11.11 applied to u — log |/'| — h and the triangle inequality, 
we only have to prove that n(SA4B n ,£ n ) — > as n goes to infinity, Lebesgue 
almost surely, where £ n is given by ((3]). 

We have the following strong approximation: for any < g < 1 and for any 
x in a set of measure > 1 — g, there is an integer N — N(g) such that for any 
k > N 

S k (x) - 0(1) < -log»(V k (x)) < S k (x)+0(1) . (21) 

Indeed, let 

B(k,C) = [p G V k : Vx e P ± < KP Q \ < C 
{ C exp-S k (x) 

Then, by Lemma 22 in [38:, given any < q < 1, there exists T e > and 
N e > such that 

Jn U P)>i-e- 

\n>N e PeB(7 h r e ) ) 

As before, we can write this distance as follows: 



K(SMB n ,£ n ) < 

I -i r+oo 



D n ^ k 

n fc=i 



, s k - kh \ £ r -i gfx(v k (-))-kh 



as n — > oo 



Vk J V Vk 
It is now straightforward to get using (|2T|) 



k=l " 



on a set of measure > 1 — q. Since g can be chosen arbitrarily small, the proof 
is finished. 

We now define the return time of a point x to its n-cylinder by 

R n (x) = inf{fc > : f k (x) G T n (x)} . 
The following result is due to Ornstein and Weiss p2>] : 

lim — logi?„(a;) = ha(f) for /i almost alia;. 

n — >oo Ji 

This result needs only the ergodicity of n to hold. 

We define the sequence of weighted empirical (random) measures of the 
logarithm of the return times by 

I n I 

OW n = jr- X] U 6 '(log R k -kh)/Vk ■ 
" k=l 
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Theorem 4.2 Under the hypotheses of Theorem \l.ll and assuming that the 
function log | /' | is Lipschitz, we have 

lim k (OWn.MO, a 2 )) = , (22) 

Lebesgue almost surely. 

The main point is to get a sufficiently strong approximation of log Rk by Sk 
that holds eventually almost surely. This is the subject of Lemma |B . II which is 
stated and proved in Appendix [Bl 

Proof. By the triangle inequality, it is enough to prove that K(OW n ,£ n ) —> 
as n goes to infinity, Lebesgue almost surely. 

By Lemma lB.ll and (|2ip . we obtain the following strong approximation: for 
any < g < 1 and for any i in a set of measure > 1 — g, there is an integer 
N = N(x, g) such that for all n > N 

S k (x)-0{l)logk< log R k (x) < S fc (x)+0(l)logfc. (23) 

We can write this distance as follows (as we have already done before): 

It is straightforward to get using 



asn -> oo 



on a set of measure > 1 — g. Since g can be chosen arbitrarily small, the proof 
is finished. 

We recall that lognormal fluctuations of the return time R n have been stud- 
ied in [S1I2H1E7]. 

A Appendix: Sharp large deviation estimates. 

In this section we will establish a sharp estimate on the probability of large 
deviations. Similar results have been obtained for independent random variables 
and Markov chains. We refer to [41], [26], [25] and [35] for a detailed discussion of 
these cases and more references. For piecewise expanding maps of the interval, 
the result has been obtained by A. Broise [5] under a generic non lacunarity 
assumption. We show here that the result holds in full generality (as in the 
independent case) for small enough values of a. The proof is very similar to 
that of [5]. We give it below for the sake of convenience. 

Lemma A.l Under the assumptions of Theorem ] 1 . SI there is a compact neigh- 
borhood K of the origin, and two constants < c\ < C2 such that for any 
a G A'\{0} and for any integer k > 1 + /3 -4 (recall that (3 — <p'{a) ) we have 

< P ( 5fc > ka \ < _£H_ e -M«) . 
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Proof. 

We first recall a convenient representation of the probability we are interested 
in. For a given a € K, let (3 = f'(a). Define a sequence of positive measures 
Vk, a by 

f 1 ^ e l3S k (x) e -kF(/3) 

dv k , a { s )= d V /> — \ttk\K \ I — h{x)5(s-k 1 (S k (x) - ka)) , 
Jo /*(«)=» l (/ } 

where S denotes the Dirac measure. After a simple computation (see [35] or [5J) 
one gets 

poo 

¥(S k > ka) = e- k ^ / e-^dv k , a {s) ■ 



In this expression of v kt0t , one sees appearing the operator L z given by 

„ (a; ) e ™(*) 

(M(2/)= 1777^1 ■ 

This operator depends analytically on z, and in the space of functions of 
bounded variations, one can use analytic perturbation theory [19] . In particu- 
lar, there is a disk D in the complex plane centered at the origin such that for 
any z inside that disk, the operator L z has a peripheral spectrum consisting of 
a simple eigenvalue denoted below e F ^ z \ In the disk D, F is analytic and the 
corresponding eigenvector and eigenform depend also analytically on z. More- 
over the rest of the spectrum is inside a disk of radius p < 1 and there is a 
uniform bound on the corresponding spectral projection. 

In particular, for a fixed real (3 € D, we have for any complex number z such 
that z + (3 E D 



e* s dv k<a {s) = dy V J hix)eZ (s k ( x )- ka) /Vk 

-oo Jo ,,77 LP Uc) 



f k (x)=y 

»i 



- e -kF(0) e -s/kaz 



( L t3+z/Vk) hh (y)dy 



Note that in the above expression, since u is bounded (and hence the right- 
hand side is an entire function of z. We now apply the analytic perturbation 
theory [19] in the last expression and get 

+oo 

e zs dv k:a {s) = C(fi + z /Vk)e- kF ^ +kF ^ +z ^^ k ^ + R k ((3 + z/Vk) 

-OO 

where Rk(w) is bounded for w E D by 

|ifcM| <0(l)(3p/4) fc 

and C(ui) is analytic for w E D. Note that since can be written as the 
difference of two functions analytic in D, it is also analytic in D and therefore we 
have estimates of all its derivatives in any smaller disk in terms of its maximum 
in D. In particular, we have uniformly in a in a neighborhood of the origin 

duk, a (a) = 0(l)fc- 1/2 
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s 4 d Vk . a {s) = 0{\) 



and 



<j{af = lim / s 2 dv k . a {s) = F"{fi) 

k — >oo J 



As observed in [5], if F"(0) 7^ 0, that is to say u is not of the form v — v o / for 
a function u of bounded variation, then by continuity we have F"(f3) ^ in a 
neighborhood of the origin in (3 and also in a neighborhood of the origin in a. 

From these estimates, we can derive a Berry-Esseen estimate. We denote by 
F k a the law of v k a and by F a the law of the normal distribution with variance 
a 2 '. 

Under the assumptions of Theorem 11.11 there is a number a' Q > and a 
number C > such that for any \a\ < a' we have 

C 

sup \F k . a (x)- F a{a) (x) I < — = . (24) 
x v k 

We refer to [T7] for the proof which uses a standard technique once one has 
adequate control over the characteristic function. See also [39] and [5] for the 
case of dynamical systems. 

The main point of the above result is that the constant C appearing in (|24|) 
is uniform in a. 

To proceed with the proof, we will first give an asymptotic estimate of 

+00 

g(f3s/Vk)dv k ^ a (s) 

-OO 

for fixed C 1 non-negative functions g which tend to zero at infinity and satisfying 
J (s 2 g(s) + |</(s)|) ds < 00. Integrating by parts we have 

-{-OQ />+00 

g((3Vks)dv k , a (s) = -pVk / g'{PsVk)F k , a { S )ds . 

■ 00 J — 00 

(The same holds for the Gaussian measure in place of z/fe, a and F a in place of 
Fk,a-) Using (|24p we get uniformly for a in a neighborhood of zero 

+00 p-\-oo ^ 

g{pVks) dv k , a (s) - / g(f3Vks) dM{0,a 2 )(s) 



+00 1 r~\-oo 



oo v u \nj J -00 



f(p8Vk)F kia (a)da--= / dsg'{(3sVk) / /(2ct(q) } ^ 

'27rcr(a) 



/+00 
ds | 5 '(/3sVfc)| < C(l)fc~ 1/2 
-OO 



Using the hypothesis k > 1 + f3 4 , we obtain 
9 



+00 1 /-+oo 

(pVks)dv kia {s) = — / d S3 ( s ) + 0(l)fc- 1 / 2 . 

The main point of this estimate is that for /3 small enough, the first term on 
the right hand side dominates (see [30] for the exact coefficient in the case of 
independent random variables). 
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The lemma follows by using g(x) = e ~v 1 + x f or an upper bound and for a 
lower bound by using a non negative C 1 function with compact support in the 
interval [1, 2] and bounded above by e~ 2 . 

The following lemma allows to control the probability of having simultane- 
ously S k > ka and S k ° f r > ka. 

Lemma A. 2 Under the assumptions of Theorem \l.l[ there is a number a\ > 
such that for any compact subset D of (— ai, 0)U(0, a\), there are two constants 
C > and c > such that for any a S D, for any integers k > and k > r > 
we have 

V(S k > ka , S k of r > ka) < Ce^ { ^- rc . 

Proof. We first observe that 

P(S fe > ka , S k o f r > ka) < V(S k + S k of r > 2ka) , 

and we will derive a large deviation estimate for this quantity. Using the family 
of operators L z defined above, one gets for any real $ 

E(e^ +S *° r )) = / (Lt L ^ L r.h)( X ) dx. 

From the spectral theory of i z , we get as in the proof of Lemma I A. 1 1 uniformly 
for (3 in a compact set containing the origin in its interior 

E ^( Sk+ Sk ° r )\ < o(l)e 2rF{ P ) e ik ~ r)F< - 2 P ) . 

We now use Chebychev inequality on the left hand side with [3 = ip'(a)/2 and 
obtain 

P(Sfe + S k O f > 2ka) < O{l) e -k^)-r(F(2~0)-2F0)) 

From our assumptions on u (in particular, u / 0), the function F is strictly 
convex, and therefore since -F(O) = we have F(2(3) — 2F([3)) > for any [3 ^ 
in a compact set. The lemma follows. 

B Appendix: Strong approximation of return 
times by the measure of cylinders 

Lemma B.l Let e > 0. Under the assumptions of Theorem \1.1\ we have the 
following: 

— (l + e)logn < log [R n (x)/j,(V n (x))] < loglog(n 1+e ) eventually almost surely. 

Proof. The lemma is the consequence of the exponential law for the distribution 
of the random variables i?„/i(7 : ' n (-)). From [38] there exists a subset of cylinders 
C V satisfying the following conditions: 

• Cn :=/i(7>nVn) with C« < 0(l)e~ Kn , K>0. 
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• For any cylinder C n £ V*, 

sup |/i {R n fi(C n ) > t\ C n ] - e-*\ < Cl e^ n (25) 

where ci,C2 are positive constants, A*{ ' | C«} denotes the conditional 
expectation. 

We want to find a summable upper-bound to 

M {log [R n fl{C n )} > logO < M(Cn)M {log [RnKCn)] > logt| C„} + Cn 

where £ will be chosen as a suitable sequence of positive real numbers. 
Then, from (j2"5")l . one gets for all £ > 



fi{log[R n fi(C n )} > log£} < Cn + m(C„) ( c ^~ C2n 



Take £ = £„ = log(n 1+£ ), e > 0, to get 



/.{logKMOO] > loglog(n 1+e )} < Cl e" C2 " + -L + C„ 



An application of the Borel-Cantclli lemma tells us that 

\og[R n (x)fj,(V n {x))] < loglog(n 1+e ) eventually a.s. . 
For the lower bound first observe that ([23|) gives, for all £ > 

fi{\og[R n fi(C n )} < log£} < C„ + cie- C2 " + 1 - e"* < Cn + cie^ 2 ™ + £ . 
Choose £ = £„ = n~( 1+e - ) , e > 0, to get, proceeding as before, 

\og[R n (x)^i(V n (x))] > —(1 + e) logn eventually a.s. . 
This finishes the proof of the lemma. 

References 

[1] M. Atlagh, M. Weber, Le theoreme central limite presque sur, Expo. Math. 
18 (2000), no. 2, 97-126. 

[2] A. Barbour, R.M. Gerrard, G. Reinert, Iterates of expanding maps, Probab. 
Theory Related Fields 116 (2000), no. 2, 151-180. 

[3] I. Berkes, Results and problems related to the pointwise central limit theo- 
rem. In : Szyszkowicz, B. (Ed.), Asymptotic methods in Probability and 
Statistics (Ottawa, ON, 1997), 59-96, North-Holland, Amsterdam, 1998. 

[4] I. Berkes, E. Csaki, A universal result in almost sure central limit theory, 
Stochastic Process. Appl. 94 (2001), no. 1, 105-134. 



22 



[5] A. Broise, Transformations dilatantes de I'intervalle et theoremes limites. 
In : Etudes spectralcs d'operateurs de transfert et applications. Asterisque 
1996, no. 238, 1-109. 

[6] G. Brosamler, An almost everywhere central limit theorem, Math. Proc. 
Cambridge Phil. Soc. 104, 561-574 (1988). 

[7] J. Buzzi, Specification on the interval, Trans. Amer. Math. Soc. 349 (1997), 
2737-2754. 

[8] P. Collet, Some ergodic properties of maps of the interval, Dynamical Sys- 
tems (Temuco, 1991/1992), 55-91, Travaux en cours 52, Herman, Paris, 
1996. 

[9] P. Collet, A. Galves, B. Schmitt, Fluctuations of repetition times for Gibb- 
sian sources, Nonlinearity 12 (1999), 1225-1237. 

[10] P. Collet, J. -P. Eckmann, Liapunov multipliers and decay of correlations 
in dynamical systems, J. Statist. Phys. 115 (2004), no. 1-2, 217-254. 

[11] P. Collet, S. Isola, On the essential spectrum of the transfer operator for 
expanding Markov maps, Comm. Math. Phys. 139 (1991), no. 3, 551-557. 

[12] P. Collet, S. Martinez, B. Schmitt, Exponential inequalities for dynamical 
measures of expanding maps of the interval, Probab. Thcor. Rcl. Fields 
123, 301-322 (2002). 

[13] F. Comets, Erdds-Renyi laws for Gibbs measures, Comm. Math. Phys. 162 
(1994), no. 2, 353-369. Detecting phase transition for Gibbs measures, Ann. 
Appl. Probab. 7 (1997), no. 2, 545-563. 

[14] P. Deheuvels, L. Devroye and J. Lynch, Exact convergence rate in the limit 
theorems of Erdds-Renyi and Shepp, The Annals of Probability, 14, 209-223 
(1986). 

[15] M. Denker, The central limit theorem for dynamical systems. In : Dynam- 
ical Systems and Ergodic Theory (Varsaw, 1986), 33-62, Banach Center 
Publications 23, PWN, Warsaw, 1989. 

[16] P. Erdos, G.A. Hunt, Changes of signs of sums of random variables, Pacific 
J. Math. 3, 673-687, 1953. 

[17] W. Feller, An introduction to probability theory and its applications. Vol. 
I & II. Second edition John Wiley & Sons, Inc., New York-London-Sydney 
1971 

[18] A. Fisher, Convex-invariant means and a pathwise central limit theorem, 
Adv. in Math. 63 (1987), no. 3, 213-246. 

[19] T. Kato, Perturbation theory for linear operators. Reprint of the 1980 edi- 
tion. Classics in Mathematics. Springer- Verlag, Berlin, 1995. 

[20] I. Gikhman, A. Skorokhod, Introduction to the theory of random processes. 
Dover, New York, 1996. 



23 



[21] A. Gibbs, F. Su, On choosing and bounding probability metrics, Interna- 
tional Statistical Review (2002) 70, no. 3, 419-435. 

[22] V.M. Gundlach, Y. Latushkin, A sharp formula for the essential spectrum 
of Ruelle transfer operator on smooth and Holder spaces, Ergodic Theory 
Dynam. Systems 23 (2003), no. 1, 175-191. 

[23] F. Haufbauer, G. Keller, Ergodic properties of invariant measures for piece- 
wise monotonic transformations, Math. Z. 180, 119-140 (1982). 

[24] LA. Ibragimov, Some limit theorems for stationary processes, Theory 
Probab. Appl. VII, no. 4, 1962, pp. 349-382. 

[25] M. litis, Sharp asymptotics of large deviations in R d , J. of Th. Prob. 8, 
501-522 (1995). 

[26] I. Iscoe, P. Ney and E. Nummelin, Large deviations of uniformly recurrent 
Markov additive processes, Adv. in Appl. Math. 6, 373-412 (1985). 

[27] A.N. Kolmogorov, V.M. Tikhomirov. e-Entropy and e-capacity of sets in 
functional spaces. Reprinted in Selected works of A.N. Kolmogorov, Volume 
III. A.N. Shiryayev editor, Dordrecht Kluver, 1993. 

[28] I. Kontoyiannis, Asymptotic recurrence and waiting times for stationary 
processes, J. Theor. Probab. 11 (1998), 795-811. 

[29] M. Lacey, W. Philipp, A note on the almost sure central limit theorem, 
Statist. Probab. Lett. 9, no. 3, 201-205 (1990). 

[30] A. Lasota, J. Yorke, On the existence of invariant measures for piecewise 
monotonic transformations, Trans. Amer. Math. Soc. 186 481-488 (1974). 

[31] F. Lcdrappicr, Some properties of absolutely continuous invariant measures 
on an interval, Ergod. Th. & Dynam. Syst. 1, no. 1, 77-93, 1981. 

[32] E. Lesigne, Almost sure central limit theorems for strictly stationary pro- 
cesses, Proc. Amer. Math. Soc. 128 (2000), no. 6, 1751-1759. 

[33] C. Liverani, Decay of correlations for piecewise expanding maps, J. Stat. 
Phys. 78 (1995), 1111-1129. 

[34] G.G. Lorentz, M.V. Golitschek, Y. Makovoz. Constructive Approximation. 
Advanced problems. Grundlehren der Mathematischen Wissenschaften 
304. Springer- Verlag, Berlin, 1996. 

[35] P. Ney, Notes on dominating points and large deviations, Resenhas IME- 
USP 4, 79-91 (1999). 

[36] D. Ornstein, B. Weiss, Entropy and data compression schemes, IEEE Trans. 
Inform. Th. 39 (1993), 78-83. 

[37] F. Paccaut, Statistics of return times for weighted maps of the interval, 
Ann. Inst. H. Poincare, Probab. & Stat. 36 (3) (2000), 339-366. 



24 



[38] F. Paccaut, Proprietes statistiques de systemes dynamiques non 
markoviens. PhD thesis of the University of Bourgogne, Dijon, France, 
2000. 

[39] Z. Coelho, W. Parry, Central limit asymptotics for shifts of finite type, 
Israel J. Math. 69 (1990), no. 2, 235-249. 

[40] V. Pctrov, Limit theorems of probability theory. Sequences of independent 
random variables. Oxford Studies in Probability, 4. Oxford Science Publi- 
cations. The Clarendon Press, Oxford University Press, New York, 1995. 

[41] V. Petrov, Asymptotic behavior of the probabilities of large deviations, 
Theor. Probability Appl. 13 (1968), 408-420. 

[42] S.T. Rachev, Probability metrics and the stability of stochastic models. Wi- 
ley Series in Probability and Mathematical Statistics: Applied Probability 
and Statistics. John Wiley & Sons, Ltd., Chichester, 1991. 

[43] E. Rio, Theorie asymptotique des processus aleatoires faiblcment 
dependants. Mathematiques & Applications 31, SMAI, Springer, 2000. 

[44] A. Rouault, M. Yor and M. Zani, A large deviation principle related to the 
strong arc-sine law, J. Theor. Probab. 15 (3), 793-815, 2002. 

[45] P. Schatte, On strong versions of the central limit theorem, Math. Nachr. 
137, 249-256 (1988). 

[46] K. Zicmian, Refinement of the Shannon- McMillan- Breiman theorem for 
some maps of an interval, Studia Math. 93 (1989), no. 3, 271-285. 



25 



